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Chapter 4 





Binomial Coefficients 


4.1 Binomial Coefficient Identities 
4.2 Binomial Inversion Operation 
4.3 Applications to Statistics 

4.4 The Catalan Recurrence 


Binomial coefficients are seemingly somehow involved 
with almost every combinatorial construction. 
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4.1 BINOMIAL COEFF IDENTITIES 


Table 4.1.1 Pascal’s triangle of binomial coefficients. 


1 1 
2 2 
3 3 
4 4 
5 5 
6 6 
7 7 
8 8 





REVIEW FROM 81.3 AND 81.5: We recall the following 
definitions and results. 


e Prop 1.3.1. The combination coefficients i satisfy 
Pascal’s recurrence: 


(3) a for all n > 0 left column 


(;) = 0 for all k > 1 top row 


Ce) = (aa) +(e’) eemet ts 


Section 4.1 Binomial Coeff Identities 3 


e The binomial coefficient 6,,;, is the coefficient of 
z® in the binomial expansion 


(1+a)” = Duss! 


e Prop 1.3.2. © The binomial coeffs satisfy Pascal’s 
recurrence. 


e Cor 1.3.3. For all k,n EN, 


rr 
= 0x 
(1) = os 


e Since combination coeffs have exactly the same values 
as binomial coeffs, as per Corollary 1.3.3, they are 
commonly referred to as binomial coefficients. 


e Prop 1.5.3. For all non-negative integers n and k, 


) = i (4.1.2a) 


e Cor 1.5.4. For all non-negative integers n and k, 


(;) = ao (4.1.2b) 
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Combinatorial vs. Algebraic Proofs 


TERMINOLOGY: An algebraic proof of an equation is 
achieved by transforming one side of the equation with the 
aid of substitutions and of arithmetic operations into the 
expression on the other side. 


TERMINOLOGY: A combinatorial proof of an equation 
is achieved by showing that both sides of the equation 
count the same thing. Sometimes such a proof uses the 
pigeonhole principle. 


Numerous examples of both kinds of proof follow. Some- 
times we give two or more proofs of a single assertion. 
Various general methods, including mathematical induc- 
tion, may be used with either type of proof. 


Symmetry 


Some identities are generalizations of properties read- 
ily noticeable in Pascal’s triangle. One such property is 
that each row of Pascal’s triangle is palindromic: it reads 
the same forward or backward. For instance, we observe 
the symmetry of row 8. 


1 8 28 56 70 56 28 8 I 


Proposition 4.1.1 [Row-Symmetry Property]. For 
any integers n and k such thatO <k <n, 


(G4) ts 
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Algebraic Proof: Using Eq. (4.1.2b) yields this easy 
algebraic proof. 


i) = mere > woe 7 (a) ¢ 


Combinatorial Proof: The left side of Eq. (4.1.3) is 
the # ways to select k objects from the set of n, to be 
in the designated subset. The right side is the # ways to 
select n — k objects to be excluded from it. There must 
be the same number of ways to do either. o; 


Row-Sum Property 


Another property of Pascal’s triangle is that the sum 
of the entries in each row is a power of 2. E.g., in row 8, 
1+8+ 28+ 564+ 70+ 56+ 284+ 841 
== 2562" 


Proposition 4.1.2 [Row-Sum Property]. The sum of 
the entries in row n of Pascal’s triangle is 2”. That is, 


- (;) = Qn (4.1.4) 


k=0 


Combinatorial Proof: According to Corollary 1.3.3, 
the summands on the left side are the number of ways to 
choose subsets of cardinality k from a set S of n objects, 
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for respective values of k. ‘Their total is the number of 
ways to select a subset from S, over all possible subset 
sizes, which is clearly 2”, shown on the right side, since 
each of the n objects is either present or absent. << 


Algebraic Proof: Substituting x = 1 into both sides of 
the equation for a binomial expansion yields the following 
result. 


(ska)? = a) x” 





k=0 
me A a > (i) : > (i) 
ce k=0 a9 k=0 





a) 


k=0 


Algebraic Proof: Another algebraic proof is by induc- 
tion, starting with row 0 of Pascal’s triangle as a basis 
case, and then using Pascal’s recursion to show that the 
sum in row n doubles the sum in row n — Ll. » 


Column-Sum Property 


Some other properties of Pascal’s triangle emerge af- 
ter further investigation. For instance, the sum of all the 
entries in any column, up to and including the entry in 
row n, can be found in the next column in row n+ 1. 
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Example 4.1.1: In columns 2 and 3 of Pascal’s triangle, 
we see the following configuration. 


; 














1+3+7+6+4+10+15 = 35 


10 
15 


Prop 4.1.3 [Column-Sum Property]. The sum of the 
entries in column c (c > 0) of Pascal’s A, from row 0 down 
to row n, equals the entry in rown+1, column c+l. Le., 


> (*) = ea, (4.1.5) 


k=0 


NOD Oo — CG bo 


Proof: By induction on the row number n. 


BASIS: For n = 0, the sum of the entries down to row 0 
is 1, in column c = 0, and is otherwise 0; also, 


( 1 ) _ 5 if c=0 
Cal 0 otherwise 


8 Chapter 4 Binomial Coefficients 


IND HYP: Assume for some n > 1 that 
Cc c+l 
IND STEP: Then 


e()-FH+0) 


— (. i ) + (" (induction hypothesis) 
1 
— Cs ') (Pascal’s recursjon) 


Diagonal-Sum Properties 


DEF: A diagonal from the upper left of a 2-dimensional 
array, toward the lower right, is called a southeast di- 
agonal, A diagonal in the opposite direction is called a 
northwest diagonal. 


DEF: A diagonal from the lower left of a 2-dimensional 
array, toward the upper right, is called a northeast di- 
agonal, A diagonal in the opposite direction is called a 
southwest diagonal. 


We observe that the sum of the elements along a fi- 
nite initial segment of the southeast diagonal in Pascal’s 
triangle appears just below the southeasternmost entry. 
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Example 4.1.2: Here is a southeast diagonal-sum. 











1 GG) 0 
nt 
0 1 2 3 4 
2 
3 3 
1+34+64104+15 = 35 
4 6 
5 10 
6) 15 
7 


Prop 4.1.4 [SE-Diagonal-Sum Property]. The sum 
of the first n+ 1 entries on the southeast diagonal from 
row r, column 0 in Pascal’s triangle equals the entry in 
rowr+n+1, column n, the entry immediately below the 
last entry of the diagonal. That is, 


= - a (4.1.6a) 


k=0 


Proof: ‘This result follows from two previously derived 
properties of Pascal’s triangle. 


ss (" : ‘) z » (" : ‘) anes 


k=0 
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(" +n+ ‘) ie is 
= column-sum property 
ral 
1 
— (" a : = ) (symmetry) » 


The following corollary simply reverses the order of 
summation of the elements on the diagonal. 


Cor 4.1.5 [NW-Diagonal-Sum Property]. For any 
non-negative integer m such that 0 <m <n, the binomial 
coefficients satisfy the equation 


: ee) (" r ‘) (4.1.6b) 


m 
Proof: Reversing the NW diagonal sum 


6-9) + Coe (9) 


on the left of the equation yields the SE diagonal sum 


Come CT ane 


which starts at row n — m and includes m+ 1 entries 
downward, ending at row n, column m. By Prop 4.1.4, 
the value of this southeast diagonal sum is the binomial 


coefficient : 
Tee 
m 
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The sums of northeast diagonals are Fibonacci num- 
bers. For instance, the sum 1+5+6+1 along the north- 
east diagonal that starts at @ is the Fibonacci number 


fr = 13. 


Example 4.1.3: The boxed Fibonacci numbers shown 
here do not actually appear at the locations shown. They 
are simply the sums along the northeast diagonals that 
lead to them. 





1 1+3+1 = [5] = & 


i+6+10+4 = i] = £ 
10 


1 
2 
3 
4A} 1 4 
Oo 
6 
7 


Prop 4.1.6 [NE-Diagonal Fibonacci Property]. The 
sum of the entries on the NE diagonal from row n, column 
0 in Pascal’s triangle equals the Fibonacci number fp+1. 


That is, 


oe) ig (4.1.7) 


k=0 
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Proof: BASIS: For n = 0 and n = 1 the northeast diag- 
onal sums are 1 = f; and 1+ 0 = fo, respectively. 


IND HYP: For n > 2 assume that 


Sepak CE fey, 
> ( ip ee and A i: ) = ha 


IND STEP: By the Pascal recursion, we have 


Wy be ore Lae 


Therefore, 


Reese et ee ee 


n—2 . n—1 
: nm—-j—-2 n—-k-1 
ae e Pe 
7=0 k=0 
Gc (induction hypothesis) 
= 7 (Fibonacci recursion )> 


Products of Binomial Coefficients 


Another pattern in Pascal’s triangle is the relation- 
ship between each element and the element to its upper 


left. 
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Example 4.1.4: We observe in this inset from Pascal’s 
triangle that 





3 

Ae i BS oa, RY ss 5 
5 AN oF 7 3 
6 20) or, equivalently 

7 6 
-4 = 140 = 

rf] 38 aronatt 
8 


The generality of this relationship, which is called the ab- 
sorption property, is established by the next proposition. 


Prop 4.1.7 [Absorption Property]. For0<k <n, 


(i) = nia] (4.1.8) 


Algebraic Proof: By algebraic manipulation, we have 
ke ke yee! =. 
Te gc Sg cae 
k k! (k —1)! (k —1)! k-1 
° 
Combinatorial Proof: Alternatively, we observe that 


the left side 
n 
k 
() 
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is the number of ways of choosing a board of k directors 
from a set of n persons and then a chairperson from within 
that board of k. This is clearly equivalent to the number 
of ways to choose a chairperson from a set of n persons 
and then another k — 1 persons from the remaining n — 1 
persons for the rest of the board of directors, which is the 


right side 
n—1 
(oa) ° 


Absorption is a special case of a relationship between 
an element and other elements along the northwest diag- 
onal direction. ‘This relationship is expressed by a highly 
useful combinatorial identity that generalizes the following 
illustration. 


Example 4.1.4, cont.: Observe that whereas at one 
position northwest of the coefficient @ we have 


6 7\ 42 4 
20 = = es Bhi 
(3) = a= 3 


at three positions northwest of ({) in Pascal’s triangle, we 


have 
ie AV  (7\ 42 | Ag 24 
SNe UN ee 210 
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3 3 
35.4 — 7 A [3! = : == dl 
A 73 4 7 fal 1 


or, equivalently 
Chey a, eB. (e 
39 4) |3 4—3) |3 


The following formulation of this property is called the 
subset-of-a-subset property. 





CON MH Ot 


Prop 4.1.8 [Subset-of-a-Subset Identity]. 
For0O<k<m<n, 


(ne) = (Care) 9) 


Algebraic Proof: By straightforward algebraic calcu- 
lation, we have 


a? ° Se 
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Combinatorial Proof: We can also reason combinato- 


rially that the left side 
n\ (fm 
mm) \k 


is the number of ways of choosing a board of m directors 
from a set of n persons and then an executive committee 
of k persons from within that board of m. This is clearly 
equivalent to the number of ways to choose an executive 
committee of k persons from a set of n persons and then 
another m — k persons from the remaining n — k persons 
for the rest of the board of directors, which is the right 


” (4 eae 


Vandermonde Convolution 


Thm 4.1.9 [Vandermonde Convolution]. Let n, m, 
and k be non-negative integers. Then 


EC) = (2) ane 


j=0 


Combinatorial Proof: A combinatorial proof supposes 
that there are n + m objects in a set, n of them blue and 
m of them red, and that k objects are to be chosen, for 


which there are clearly Ce) ways in all, the number of 
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the right side. ‘The number of ways to select 7 blue objects 
and k — 7 red objects is the product t) ae .); so the sum 


of all these products, which is on the left side, must be the 
same total as the right side. .o; 


Another Proof: The sum on the left of the combinato- 
rial equation above equals the coefficient of x” on the left 
side of the polynomial equation 


(l+2)"1+2)" = (L+2)°"™ 
and the binomial coefficient on the right side of the combi- 


natorial equation equals the coefficient of x” on the right 
side of that polynomial equation. o; 


Summary of Binomial Coeff Identities 


Table 4.1.2 Basic Binomial Coefficient Identities 


=] n—1l 
( Le ) + a) Pascal Rec (4.1.1) 


k 
= oa Falling Power Formula (4.1.2a) 


Gan 
=~ 3 
ee 
| 


ki(n—k)! 


(i) 
(;) See tee homie. (0008) 
(i) 


n 
= a Symmetry (4.1.3) 
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De (") cee Row-Sum (4.1.4) 
k=0 
(") = (’ : : Column-Sum (4.1.5) 
r=0 . ctl 
pa i — . ae ) SE Diagonal (4.1.6a) 
k n 
k=0 
m =a 1 
De (" ) — (" r ) NW Diagonal (4.1.6b) 
m—k m 
k=0 
fia k 
> ( k ) SS Fibonacci NE Diagonal (4.1.7) 
k=0 


n n—l 
(") — (i _ q Absorption (4.1.8) 


= (") . : 9 Subset-of-a-Subset (4.1.9) 


m—k 
‘ = cera Vandermonde Convo (4.1.10) 
gj} \Ek-3 k 


Parity of Binomial Coefficients 


"oN 
3 3 
We 
"oN 
x 3 
Wo fe 
| 


Beyond the basics of binomial coefficients, there are 
many fascinating byways. For instance, how might one 
determine the parity of a given binomial coefficient, such 


as 
165 
93 


( 
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without doing a lot of calculation? Scanning Pascal’s tri- 
angle enhances the mystery. One observes that all the 
entries in rows 1, 3, and 7, numbers of the form 2” — 1, 
are odd. Moreover, the number of odd numbers in a row 
appears to be a power of 2. Determination of the parity 
of a binomial coefficient was studied systematically by the 
British mathematician James Glaisher (1848-1928). 


Theorem 4.1.10. Let n and k be non-negative integers. 
Then 


0 mod 2 if n is even and k is odd 


Ct) = | (Ll) mot 2 otherwise 


Proof: ‘This proof splits naturally into four cases. 


Case 1 — n even andk odd: Since n is even, it is clear 
that, for this case, the value of the right side of the ab- 


sorption identity 
n n—1 
k — 


is even. Since the product k (7) on the left side must also 
be even, and since k is odd, it follows that Gs is even. 


Case 2 —n even andk even: For this case, we expand 
the binomial coefficient. 


k} 1-2-3---k 


) - n& — n(n—1)(n—2)---(n-k +1) 
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— (n-1)(n—-3)-:-(n-k4+1) n(n—2)(n—4)---(n-—k+2) 


16 Se5es(k= 1) eine eet 


Since the denominator has k/2 even factors, we continue 


— (n-1)(n—3)-:-(n-k4+1) n(n—2)(n—4)---(n-—k+2) 


1-3-5-+-(k-1) 28 1292 34.<8 


and, since the numerator has k/2 even factors, 


_ (n=1)(n— 3) (n= k+1) 2? - FG - DG - 2) G- FF) 


1-3-5---(k—-1) 23 .1-2.3-.-8 


2, eed) 3 ea hee 1) ao) 
1-3-5---(k—-1) k/2 


Therefore, 


n n/2 


1-3-5 (7) 2 (n= ln=3)--(n =k) 


It follows that for n and k both even, 
n n/2 |n/2| 
= = d 2 4.1.11 
(i) = (ie) = (ina) ee 


The first equivalence in (4.1.11) holds because each of the 
factors preceding the binomial coefficient in the numerator 
and in the denominator is odd, and multiplication of an 
integer by an odd number does not change its parity. The 
second holds because n/2 = [n/2| and k/2 = |k/2| for N 
and & both even. 
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Case 3 — n odd and k odd: As in Case 1, our starting 
point is the absorption identity 


n n—1l 
k = 
(i) a¢ - , 
Since n and k are both odd, and once again, since multi- 


plication of an integer by an odd number does not change 
the parity, it follows that 


(5) = (62) mes 


Since n — 1 and k — 1 are both even, it follows from Case 
2 that 
n—1 |[n/2| 
= d 2 
(i - , (ik) a 


and, hence, that 
(i) = (ity) moe? 


Case 4 — n odd and k even: ‘The symmetry identity 
implies that 


n—H)(7) = mH") and ieee = n("2") 


It follows from the absorption identity 


OL) =a) 
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n n—1 
—k = 
mf) = 0) 
Since n —k and n are both odd, we have 
n n—1l 
) = ( i ) mod 2 
Applying Case 2 to the right side, we obtain 
M\ = 2 mod 2 
k [A /2 


Since n is odd, the upper index [(n — 1)/2] equals |[n/2]. 
? 


that 


A simple algorithm to decide the parity of a binomial 
coefficient is to apply Theorem 4.1.10 iteratively, either 
until the upper index is even and the lower index odd or 
until the lower index is 0. 


Example 4.1.5: Here are both possible types of termi- 


nation. 
Al 2 
= : = 0 mod 2 
23 11 


(8) « (2) « 
(") = (7) = (2) = (0) = () sme 
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In order to see why the number of odd binary coefficients 
in a row of Pascal’s triangle is a power of 2, we observe 
that, in binary numbers, the integer operation 


n> [n/2| 


is achieved by erasing the rightmost bit. We observe also 
that Case 1 of Theorem 4.1.10, in which n is even and 
k is odd, is discernible by a 0-bit at the right end of the 
binary numeral for n and a 1-bit at the right end of the 
binary numeral for k. If the parity algorithm uses binary 
numerals, then iterative erasure of the rightmost bits is 
not actually necessary. It is possible, instead, to align both 
numerals flush right and to scan to see whether there is a 
0-bit above a 1-bit. 


Example 4.1.5, cont.: In scanning the aligned binary 
numerals 
16549 = 10100101. 
9319 = O1OLIIOLs 


leftward from the right end, the first occurrence of a 0 in 
the upper index occurs at the 2'-bit. Since there is also a 
0-bit immediately below it, the scan continues. ‘The next 
0 in the upper index occurs at the 2°-bit, and there is a 
1-bit below it, so the scan terminates and the decision is 
even parity. In scanning the aligned binary numerals 


7519 = 1001011, 
llig = 0001011, 


one observes that there is a 0-bit beneath every 0-bit in 
the upper index, so the decision is odd parity. 
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Proposition 4.1.11. The number of odd binomial coef- 
ficients in row n of Pascal’s triangle is 2", where w is the 
number of 1-bits in the binary representation of n. 


Proof: For the binomial coefficient CG) to be odd, there 
must be a 0 at each bit in the binary numeral for k for 
which there is a 0 at the corresponding bit of the binary 
numeral for n. However, if there is a 1 at a bit of the 
binary numeral for n, there may be either a 0 or al at the 
corresponding bit of the binary numeral for k. If there are 
w 1-bits for n, then there are 2” values for k that satisfy 
the rule for the 0-bits. > 


Corollary 4.1.12. If the integer n is of the form 2" — 1, 
then every binomial coefficient in row n of Pascal’s triangle 


is odd. 


Proof: ‘There are no 0-bits in the binary representation 
of 2” — 1. » 
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4.2 BINOMIAL INVERSION 


This section develops an incremental technique used 
with binomial coefficients, called binomial inversion. Its 
main application in this section is within a solution of the 
derangement recurrence. 


DEF: The transform of the sequence (fp) under bino- 
mial inversion is the sequence (g,,) with 


In = (ah (4.2.1) 


j=0 


A characteristic property of anything mathematical that 
is correctly called a duality operation is that a second 
application of the operation restores the original object. 
Theorem 4.2.1 confirms that a transformation called bino- 
mial inversion of sequences has this property. 


Theorem 4.2.1. Let (fr) be a sequence and (g,) its 
transform under binomial inversion. Then, for all n > 0, 


joe > (i)iva (4.2.2) 


In other words, retransformation restores the original se- 
quence (f;,). 
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Proof: Start at the right side of Eq. (4.2.2) and substi- 
tute the inversion formula of Eq. (4.2.1) for g;. 


3 (Jova= EG) » o (“is 


j=0 i=0 


EEC )Cvn as 


i ze 


Exchanging the order of summation is useful here. 


-EEC)(arn aes 


t=O) G1 


Applying subset-of-a-subset identity (Prop 4.1.8) reduces 


the # occurrences of the summation index 7. 
n—4 aa 
= Ee Gr evrs 
P= 
1=0 7=2 


Then factor to simplify the inner summation. 


SC EC arse er 
Substitute k= j —i. 
= > (")s > (2) 


Inner summation —> exponentiated binomial. 


1 
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i=0 
Using the Iverson truth function (¢ = n) leads to comple- 


tion of the proof. 
n a . 
Le (")s (t =n) 
i=0 


(") fn =n): 25, % 


nm 


Observe at Eq. (4.2.4) above that the summation index 
7 occurs twice in the summand within a binomial coeffi- 
cient, once as an upper index, and once as a lower index. 
In such circumstances, as seen here, the subset-of-a-subset 
identity often facilitates a transformation that reduces the 
number of occurrences of the summation index in the sum- 
mand. 


Some Basic Examples of Inversions 


The first three examples here of inversion are intro- 
ductory, to show how inversion works. 


Example 4.2.1: ‘The constant sequence 


Gr anak i Ae a 


has the inversion 
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= E(w 
= (l-«2)” a 

n 1: A= 0 
aed) NG ifn >0 


= 4G.) 2k 0-:0.-0 
More generally, the sequence 


a Se SO, WG. tr 


has the inversion 


(On) = ie: Oe “OO 


Example 4.2.2: The natural number sequence 


(fa) = 0 12 3 


is inverted as follows. 
In = » (") Ee 
Sj (") (—1) (4.2.5) 
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Apply the absorption identity to eliminate an occurrence 


of the index 7. 


S9(3 ey 


re Ga)ey 


j=0 


Substitute 7 =7+ 1 to align the binomial coefficient with 


the summation limits. 


| 
= 
3 
sl 
ke 
aN 
= 
~, | 
— 
n—__” 
| 
ea 
oh 
ke 





_) Jed as 
ev-1 |0 ifn#fl 
=> (gr) = 09 -1 0 0 


In Eq. (4.2.5) of this example, the summation index j oc- 
curs within a binomial coefficient and also as a multiplier. 
The absorption identity is the usual binomial identity by 
which the number of occurrences of the index variable is 
reduced in such a circumstance. 


The seq 0 1 2 3. ---+ can also be represented as 


((7)). Accordingly, it should be unsurprising if calculating 
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the inversion of the sequence is similar to Example 
AD? 


Example 4.2.3: ‘The binomial coefficient sequence 


i-() 


for a fixed non-negative number r has the inversion se- 
quence 


Gn. = » ("ows 


j=0 
-\ tc ) (") Cay (4.2.6) 
; r J 
j3=0 
Apply the subset-of-a-subset identity and then factor. 


EC)" 


j=0 


- (EGY 


Substitute 7 =7 +r. Then 


m= (JECT 


=r 


-QEC ar 
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EC en 


. iP 
0 ifnAzr 


At Eq. (4.2.6), the summand has two occurrences of the 
summation index 7. ‘This time, both are within different 
binomial coefficients, with one occurrence as an upper in- 
dex and the other as a lower index. ‘The subset-of-a-subset 
identity is frequently used to eliminate one of the occur- 
rences in such summands, thereby simplifying the sum. 


Derangements 


The point of learning how to invert sequences is not 
just to pose a new class of computational exercises. Bino- 
mial inversion has numerous extrinsic applications. 


Example 4.2.4: Every permutation of the integer in- 
terval [1 : n] can be obtained by choosing r numbers from 
[1 : n] and deranging them. Accordingly, if D; is a de- 
rangement number, then 


m= (cet Gre (ors (0) 


It follows that the sequence 


Fn — (=), 
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has the binomial inversion 


= 


Dey 


| 
LY] 
= 
Je. 
| 
ee 
Te 


Dy, a 1 1 1 ir lien 
Ge ay gh Ne) alae 


al MOO 
Thus, the proportion of derangements among the permu- 
tations of a set of n objects tends to e~! as n gets larger, a 
result that we previously derived with inclusion-exclusion 
in Example 3.6.5. This illustrates again our perspective 
that it is helpful to have a variety of mathematical tools 
available for the solution of a given problem. 


—l 
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More Examples of Inversions 


The summation techniques presented in this section 
for transforming sequences are widely applicable. The 
next section of this chapter applies these methods to com- 
putations in probability and statistics. We complete the 
present section with two more examples that combine the 
method of binomial inversion with the binomial identities 
derived previously. 


Example 4.2.5: When two factors of a summand are 
both binomial coefficients that contain the index of sum- 
mation as a lower index, the key to simplification is to set 
up an application of the Vandermonde convolution, which 
would simplify the summand. The sequence 


fe = (-1)"(") 


has as its binomial inversion the sequence 


In > (") (-1)’ fj 


= (")enew(*) 
-S(°() 


Apply the symmetry identity as a setup 
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vcd 
ad 
and then invoke the Vandermonde convolution. 
=, Fae? 
i n 
Example 4.2.6: Sometimes there is a quotient of two 


binomial coefficients both of which contain the index of 
summation. The sequence 


n=“) 


has as its transform under binomial inversion the sequence 


Gn 


| 
Me 
is 
Ns 
| 
ke 
es 
ot 


Here we apply the subset-of-a-subset identity 


CG) = GG) 


thereby obtaining 
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(') 
1A J 
oe d N\ (n / N—j 
n}\J n—-Jj 

oe oo 

1 ra n-J 
which can be simplified using the diagonal-sum identity 

N\(N+1 

n n 

N+1 
N-n+l1 
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4.3, APPLICATIONS TO STATISTICS 


Binomial coefficients frequently occur in sums that 
arise in probability and statistics. We continue to seek 
to reduce the number of occurrences of the index of sum- 
mation within the summand. There are a few additional 
rules of thumb to be learned here and used. 


Probability and Random Variables 


Some basic definitions are now recalled from elemen- 
tary statistics and probability. ‘The pace of the exposition 
here presumes that the reader has some prior familiarity 
with these topics. 


DEF: A discrete probability space is a pair (Q, Pr) as 
follows. 

e ‘The discrete set ( is called a sample space. 

e A subset of 2 is called an event. 


e The set 2® of all subsets of Q is called the event 
space. 


e The function Pr : 2° — R is called a probability 
measure, and it satisfies the following axioms. 


1.0 < Pr(A) < 1, for every event A C Q. The 
number Pr(A) is called the probability of the 
event A. 


21. 
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3. Ifthe events A;, for s € S, are mutually exclusive 
subsets of 2, then 


Prt LU 7 ee S_ Pr(As) 


secs secs 


DEF: A random variable X on a sample space is a real- 
valued function. It is called a discrete random variable 
if the set of values it takes is finite or countably infinite. 


NOTATION: Let X :Q — R be a discrete random variable 
on a sample space {2 with probability measure Pr. For 
x € R, the probability of the set {w € Q | X(w) = z} is 
denoted Pr(z). 


Mean and Variance 


The expected value of a random variable, also called 
the mean, is commonly described as a weighted average. 
The variance and the standard deviation are measures of 
dispersion from the mean. 


DEF: Let X : (2 > R be a discrete random variable on a 
sample space (2 with probability measure Pr, and let D 
be the set of values that X takes. The expected value 
or mean of the random variable X, denoted EX) or 
jtx, is the sum 


E(X) = px = >_> 2&-Pr(z) (4.3.1) 
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DEF: Let X : 02 > R be a discrete random variable on a 
sample space (2 with probability measure Pr, and let D 
be the set of values that X takes. The variance of the 


random variable X , denoted V(X) or o%, is the sum 


V(X) = 0% = i@—px)?-Pr(e) = BCX - xP) 
aa (4.3.2) 


DEF: Let X : (2 — R be a discrete random variable. The 
standard deviation of the random variable X, de- 
noted SD(X) or ox, is the square root of the variance. 


SD xX) Siox = -ae4 (4.3.3) 


NOTATION: When it is clear from context to which ran- 
dom variable X they pertain, the subscripts for mean and 
variance may be denoted ps and o°. 


DEF: In calculating the mean of a list of numbers or 
the variance of a list of numbers, one regards each 
element of the list as equally likely. 


Proposition 4.3.1. Let X :Q — R bea discrete random 
variable, T’hen 


C= POO) Sa (4.3.4) 


Proof: Let D be the set of values taken by X. We pro- 
ceed straightforwardly, starting from the Equation (4.3.2). 


o% = SD (e@-px)?-Pr(2) 


rED 
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S_ (2? — 2apx + wx) Pr(z) 


xrED 
= ye oe Pee -S— 2x px Pr(a) + ba pxyPr(x) 
rED xrED rED 
SO) Dae SC ePre) + px Pao) 
ED zED 
= E(X*)-2ux-pxtpx 1 
= E(X*)—px ° 


Binomial Distribution 


The prototypical experiment whose outcomes have a 
binomial distribution is a sequence of n tosses of a coin. 
Taking one of the possible outcomes of an individual toss, 
say heads, to be “success”, what is binomially distributed 
is the number of heads. We now apply the binomial coef- 
ficient identities of $4.1 to the calculation of the mean and 
variance of the binomial distribution. 


DEF: Given an experiment with binary outcome (success 
or failure) that is to be performed n times, the binomial 
random variable X is the number of successes. Suppose 
that the probability of success is p, and that the n trials 
are independent. Then 


pr(x=j) = (")Pa-p (4.3.5) 


The sample space is the sequence of outcomes of the n 
trials. An event is a set of possible outcomes. 
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Proposition 4.3.2. The expected value of a binomial 
random variable X on n trials, each with probability p 
of success, is 


E(X) = np 
Proof: Eq. (4.3.1) defines expected value. 


mr 


E(X) = Doj-Pr(X =3) 


j=0 
We substitute the probability of a binomial random vari- 
able, as given by Eq. (4.3.5). 


= oe, ) p(l—p)"? 


Absorption eliminates one of the four occurrences of the 


summation index 7. 


Yon(" ') Pap? 


j=0 
a g-—1 
j=0 
Substituting 7 = 7 — 1 yields the summation 
n—1 | 
= t 1 a n—1—2 
np ¥ ( ) vi P) 


that we recognize as a binomial expansion, and simplify. 
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= np[p+(1—p)]"~ 


Prop 4.3.3. The variance of a binomial random variable 
X on n trials, each with probability p of success, is 


V(X) = np(l—p) 
Proof: Once again, start at Eq. (4.3.1). 


B(X*) = Soi? Pr(X =3) 


= a) Py) eo? 


There are once again four occurrences of the index 7 of 


summation. Applying absorption reduces the exponent of 


7 in one occurrence, a reasonable step. 
mr 
; ||. : be 
Soin (41) aay 
: geal 
j=0 
~ n—1l 
= : J—1(q — p\"—I 
Pda) (oie (1—p) 


Substitute 2 = 7 —1 to align the indices of the binomial co- 
efficient with the upper and lower limits of the summation, 


another reasonable step. 
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np at +i) e 2 f pip 


Splitting the sum like this helps here 


n—1 n—1 
n—l1\ ; n—1-i .(m-ly n—1—i 
nw ( ; ) r'a-p : + mp Di( . )r'a-p : 


because the summation in the first part is recognizable as 


a binomial expansion. 


n—1 
.(n—-l\ , n—1-i 
rn — 
pt wi ( )r'a P) 
~=0 
Applying absorption again now eliminates one occurrence 
of the summation index. 
~ n—2 
n =| t t= n—1—2 
p t+ mp d_(n ("FD P) 
7=0 
Substituting k = 7-1 realigns the lower index of the 
binomial coefficient with the lower limit of the summation. 


16 fii? ed 
np + n(n 1)e? ("2") pea — pyre 
k=0 


np + n(n—1)p? = np + n*p* — np’ 


= np? + np(l—p) 


By Propositions 4.3.1 and 4.3.2, 
o% = E(X?) — B(x) 
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= [n*p? +np(1 —p)] — n°p* 
np(1 — p) » 


Unbiased Estimator of the Mean 


An intuitive statistical approach to estimating the 
proportion of persons in a population of large size N who 
have a given characteristic (such as enjoying recreational 
mathematics) is to take a random sample and to use the 
proportion in that sample to estimate the proportion in 
the general population. We will use binomial coefficient 
identities in confirming the validity of this approach. 


DEF: An estimator 6 of a statistical characteristic 0 of a 
population is said to be an unbiased estimator if the 
expected value E(@) for a random sample equals @. 


Proposition 4.3.4. The sample proportion is an unbi- 
ased estimator of the proportion of individuals in a general 
population that have a given characteristic. 


Proof: Suppose that in a population of size N exactly 
M individuals have a given trait. A sample of size n is 
taken. The random variables of interest are the number m 
of persons with that trait and the proportion 


of persons with the trait. The total number of ways to 
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choose a sample of size n is 

N 

n 


The number of ways that a sample of size n could have 
exactly 7 persons with the prescribed trait is the product 


Gna) 

j nj 

of the number of choices of 7 individuals from the popula- 
tion of size M with the trait and the number of choices of 


the remaining n — 7 individuals from the N — M persons 
who do not have the trait. ‘Thus, 


Gs) 
LA 7) = J 7 J 
Gh) 
Accordingly, 


B(X) = D2. Pr(m=s) 
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20) oe CC) 


Apply the absorption identity to eliminate one occurrence 


of the index of summation. 
eG Bee N—M 
— M.- 
al Poet 
wens = ‘) (“-") 
n\n = j—1 n-J 
Now use the Vandermonde convolution. 

_ M(N\7(N-1 

— onikn n—l 


Mont (N-1)=1 M 


n N2 (n-1)!!  £4N 


Thus, the intuitive method of estimating the mean is un- 


biased. » 


Unbiased Estimator of the Variance 


Let X be a random variable on a space (2. The iden- 
tically distributed random variables 


XY A use Ky 


are the values of X on n samples from (2, with sample 
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mean X. Statisticians use the estimator 
> 7 2 
ge 2X — DO. on a a 
n—l n—l 


with n — 1 in the denominator (rather than n), for the 
variance. This is explained by the next proposition. 


Proposition 4.3.5. The sample statistic 
. X; — X)? SOL) 
o2 a Doh ) os yy t nr Qe ) (4.3.6) 
n—-1 n—1 


is an unbiased estimator of the variance of the random 
variable X . 


Proof: 
a (EX) BOX 
ee ey 
= G0 Sy oH X;X;) 





a 


Split the double summation into two parts. 


. pT R A = weap 
oy wx). (7 #%) 


aera 
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7 (— - “ ee) 
Ew (G42) 





t= FS=L 
1 mr 
= <-> 0 B(x?) - 2 EUNDE(X) G49) 
n ¢ n(n — 1) 
11 t= 1:41 
Both sums are resolvable. 
1 —1 
= —-nE(X*) — ne Dee 
n n(n — 1) 
= (XA) = lx) 
= V(X) 


Thus, division by n — 1 leads to an unbiased estimate. > 
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4.4 THE CATALAN RECURRENCE 


REVIEW FROM 81.2: 


e The Catalan sequence {c,,} is defined by the recur- 


rence 
Gy Ss initial value 
Cn = CoCn—1 + C1Cn—2 T+ + + Cn—10€0 forn > 1 


Surprisingly, perhaps, the closed formula for c, is a mul- 
tiple of a binomial coefficient. 


Binary Trees 


In graph theory, the set 7, of binary trees can be 
defined recursively: 


e The empty tree © is in the set 7,. 


e The tree K? with a single vertex designated as the 
root is in the set 7,. 


e If 7’ € 7,, and if v is a vertex of the tree T’, then each 
of the following rooted trees is in the set 7,. 


i. ‘The tree obtained by adjoining a new vertex to 
v, called the left-child of vertex v. (A vertex has 
at most one left-child.) 


ii. ‘The tree obtained by adjoining a new vertex to 
v, called the right-child of vertex v. (A vertex 
has at most one right-child.) 
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Figure 4.4.1 illustrates the binary trees with 0, 1, 2, and 
3 vertices. It is easy enough to verify for these small cases 
that the Catalan number c, is the number of binary trees 
with n vertices. 


Co =1 @m {empty tree} 
Cy —1 O 
Oo Q 
Co= 2 / \ 
@ @ 
O O Q Q Q 
/ / / \ \ \ 

ea = g « e @ ? ® 

/ \ / 

e @ & @ 


Fig 4.4.1 The smallest binary trees. 


Remark: In computer science, each child of a vertex of a 
binary tree is designated either as a left-child or a right- 
child, even if there is only one child. The importance 
of this designation occurs in applications such as binary 
search trees and priority trees (see [GrYe2006]). 


DEF: The left subtree of a binary tree J’ is the subtree 
whose root is the left-child of the root of T. The right 
subtree of a binary tree J’ is the subtree whose root is 
the right-child of the root of 7’. 
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Proposition 4.4.1. For n > 0, the number of n-vertex 
binary trees equals the Catalan number cy. 


Proof: By induction on the number n of vertices. 


BASIS: Clearly, co = 1 and c; = 1 are the numbers of 
binary trees with 0 and 1 vertices, respectively. 


IND HYP: Let n > 0. Suppose for all integers k with 
O<k<n, that c, is the number of binary trees with k 
vertices. 


IND STEP: Suppose that a binary tree has n vertices. For 
k = 0,1,...,n —1, the number of possible left subtrees 
with k vertices is cz, by the induction hypothesis. Of 
course, the right subtree would then have n—k—1 vertices, 
so that there would be a total number of n vertices within 
the union of the two subtrees and the root, as depicted in 
Figure 4.4.2. 


ss new root 
” ~N 


binary tree with Z ~ 
k vertices = binary tree with 
S co n-k-1 vertices 


Fig 4.4.2 Joining left and right subtrees to a root. 


The induction hypothesis also implies, therefore, that the 
number of possible right subtrees is c,_,_1. Hence, by the 
rule of product, there are cycpn—p_—1 n-vertex binary trees 
with k vertices in the left subtree. Accordingly, the total 
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number of n-vertex binary trees is given by the sum 


n—-Ll 


Cy, = S Ck Cn—k—-1 
k=0 
= CoCn—-1 + C1 €n—-2 + °°* T Cn—-1 0 >» 


Nested Parentheses 


The set P of well-nested strings of parentheses 
is defined recursively (as depicted in Figure 4.4.3 below): 


e The empty string A is in P. 
e If P;,P, © P, then the string (P;)P, is in P. That 


is, we insert the string P; inside a new pair and then 
juxtapose the string P, at the right. 


In listing the well-nested strings with 0, 1, 2, and 3 pairs 
of parentheses, the new pair specified by the recursion rule 
above is depicted by brackets. 


0 pairs A = empty string Hl 
lpair ||] Creel 
2 pairs []0); [0] = 2 


3 pairs [| J)Q, [1(Q),L0J0, LOO () cz = 5 


Proposition 4.4.2. Forn > 0, the number of well-nested 
strings of parentheses equals the Catalan number c,. 


Proof: ‘This proof follows the exact same lines as the 
proof of Proposition 4.4.1. ‘The new pair of parentheses 
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with well-nested substrings inside and outside in the re- 
cursive construction here corresponds to the new root with 
left and right binary subtrees there. o; 


Subdiagonal Paths 


DEF: A northeastward path or NE-path in the array 
[0 : n] x [0 : n] is a path whose directed edges are each one 
unit in length and lead northward or eastward. 


DEF: A subdiagonal path from (0,0) to (n,n) in [0 : 
n| x [0 : n] is a NE-path along which each point (z, y) 
satisfies the inequality x > y. 

The inequality in the definition means, as illustrated in 


Figure 4.4.3, that the path never crosses above the longest 
northeast diagonal. 


ee © © © @ (5,5) 





(0,0) eo © @ @ 
Fig 4.4.3 A subdiagonal path from (0,0) to (n,n). 
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Proposition 4.4.3. For n > 0, the number of subdiago- 
nal paths from (0,0) to (n,n) in the array [0 : n] x [0: n| 
equals the Catalan number c,. 


Proof: In every prefix of a well-nested string of paren- 
theses, the number of left parentheses is greater than or 
equal to the number of right parentheses, and the total 
number of left parentheses equals the total number of right 
parentheses. Both these facts are provable by an induc- 
tion on the length of the string. It follows that the well- 
nested strings of n pairs of parentheses are in bijective 
correspondence with the subdiagonal paths in the array 


[0 : nm] x [0: nl]. > 


Solving the Catalan Recurrence 


Of the many methods of solving the Catalan recur- 
rence, the one now presented, based on work of D. Andre 
in 1878, is probably the simplest to follow. 


Theorem 4.4.4. The Catalan recurrence 


coe = 4s initial value 


Ce. = CC ya C19 a OP a epee. Sor ne) 
has the solution 
1 2n 1 (2n)! 
Cn ss =O te 
n+1il\n n+1 nin! 
Proof: Consider the set Syg of all NE paths from (0,0 to 
(n,n) in [0: n] x [0: n]. Suppose that each step eastward 
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on an NE path is represented by the letter E and that 
each step northward is represented by the letter N. For 
instance, the path in Figure 4.4.3 is represented by the 
string 


BNEENENNEN 


This correspondence is evidently a bijection between the 
set Sng of NE-paths and the set of strings in E and N of 
length 2n with n occurrences of each letter. ‘The number 
of ways to choose the n locations for the N’s in such a 


string is 
2n 
n 


The bijection establishes that this is the total number of 
NE paths. By Proposition 4.4.3, the Catalan number cy, 
equals the number of subdiagonal paths in the array (0 : 
n| x [0 : n]. Our approach is to subtract from this total 
the number of strings that do not represent subdiagonal 
paths. 
Observe that a path is not subdiagonal if and only if, at 
some point, the number of northward steps has exceeded 
the number of eastward steps. Accordingly, the corre- 
sponding string 

5159..-89n 
would have a prefix in which the number of N’s exceeds 
the number of E’s. If 27 +1 is the smallest index at which 
this occurs, then the number of E’s in the prefix 


$152-+-S2j41 


is 7 and the number of N’s is 7 + 1. 
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It follows that in the suffix 


$274+252743+++52n 


there aren —Jj E’s and n— 7-1 N’s. Suppose that in the 
suffix, each E is replaced by an N and each N by an E. This 
is called a reflection of the subpath or a reflection of 
the substring. The resulting string has n—1 E’s and n+1 
N’s. It represents an NE path in [(0:n—1] x [0: n+ 1], 
and there is a bijection between the set of non-subdiagonal 
paths from (0,0) to (n,7) in [0 : n] x [0 : n] and the set of 
NE paths from(0, 0) to (n—1,n+1) in [0 : n—1]x[0: +1], 
whose cardinality is 
2n 
(a) 


Thus, the number of subdiagonal paths from (0,0) to 
(n,n) in [0: n] x [0: n] is 


a : ( 2n (2n)"*(n+1) | (2n)**n 





n n—-1 n! n! 
1 2n 
7 ail") Y 

1 /6 20 

E le 4.4.1: = - = — = 86, 

xample C3 (3) 1 
1/8 70 

Example 4.4.2: cy, = = S27 S714 
5\4 5 
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Generalized Binomial Theorem 


An alternative proof of the solution to the Catalan 
recurrence uses the generalized binomial theorem. 


NOTATION: The k* derivative of a function f(x) may be 
denoted f*), 


DEF: An analytic function is a function f(x) with an n® 


derivative for every n > 0. 


Thm 4.4.5 (Generalized Binomial Theorem). For 
any real number s, the exponentiated binomial (1 + x)° 
has the power series 


(1+<a)° s (|) a 


k=0 


git 52 2 gk I 
Bde ag tie By gee Spee ae + ++. 


Proof: For f(z) =(1+2)%, observe that 








f(0) = (t+e2)*) = = 8° 

f (0) = 6 Lata) = s-16 1 = gt 
x=0 

f (0) = s(1+2)* eae st 
x=0 





By induction, it can be proved that 
pO) ee 
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Recall that the Maclaurin series expansion* of an analytic 
function f(z) is 





So. Ff) 
ie ye ae ok 


k=0 


2 HOO) 50, FO) at 


0! l! 2! 


TNO} te es 








Thus, the substitution f{”) (0) = s™ yields the conclusion. 
? 


Example 4.4.3: In the solution of the Catalan recur- 
rence below, we use this generalized binomial expansion. 


Lag = s (?) (—4z)* 
( 


se 


* This is equivalent to the Taylor series expansion at © = 0. 
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Alternative Proof of the Catalan Formula 


An alternative proof of the solution 


i 2n 1 (2n)! 
Cn — — . 
n+il\n n+1 nin! 


to the Catalan recurrence provides a traditional illustra- 
tion of the power of the method of generating functions in 
solving recurrences. We define the generating function 


Cle) = See 
n=0 


and begin as in 82.2. 











Step la. Multiply both sides of the Catalan recursion by 


Oi. 


n—-Ll 


a ye Crepe (4.4.1) 
k=0 


Step 1b. Sum both sides of Eq. (4.4.1) over the same 
range of values, as large as possible. 


oO co n-l 
a ) ) Creu Ae (4.4.2) 
n=1 n=1 k=0 


Step 2. Replace the infinite sum on the left of Eq. (4.4.2) 
with a finite sum involving the generating function C(z). 


oo n-l 


C(z) —c9 = Sy eae” 


Exchange the order of summation. 
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OO OO 
Tr 
= ) ) CkCn—k—-12 


k=0 n=k+1 
= “Da! 32 eats" 
n=k+1 
Substitute 7 =n—k—-1. 
OO 
= ast Dez 
j=0 


k=0 


CO 


= 2 »; c,z* C(z) 
k=0 


= 20 (2) ys chez 


= 202): 
zC(z)* — C(z) +1 = 0 (4.4.3) 


Step 3. Solve for C(z) in Eq. (4.4.3) by the quadratic 


formula. 
ltvJ1—4z 


oA) = 22 


(4.4.4) 


Step 4. ‘To solve for the value of the general Catalan 
number c,, we apply the Generalized Binomial ‘Theorem, 
as in Example 4.4.3, to Eq. (4.4.4). 


n=O 





k-1 
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1 1 
— ie Se NE BAD a a, 
1! 2! 
Since every term of this series except the first is signed 
negative, the appropriate choice is the negative root. That 


is, 


|S 


ces = = = — 2) (—42) 04.4.5) 


Nike 





To simplify (4.4.5), we expand part of the summand 


1 /l oe 1 1-1 -3 —(2n — 3) 
ni \2 nm. 2-2 2 2 
1 (a go eal . 
le oe a=) 
j=l 
np] 21 . 
a Re ea eD) 
n! oe : 2] 
j=l 


~1)"-1(2n — 2)! 
2 Sao (4.4.6) 


and we substitute the result (4.4.6) back into Eq. (4.4.5), 
to obtain 
ay are = 

C(2) = BY az 


2z ni 
n=l 


Nike 
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Shee Oy oi 
= eae 


spe (-1)"-1(2n — 2)! 


=—) ~~ — (-42)” 
2n-1(y — 1)\In! ( 
De a, 2 (n—1)In! 


Sih (Sl) On?) 
n=1 ane 
_ (2n — 2n — ee, 
7 = (n —1) ae ->5 n—-1 


— a 
are +1\n 


This yields the conclusion 








